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Abstract 

In this paper, we revisit the much studied LCS problem for two given sequences. Based on the 
algorithm of Iliopoulos and Rahman for solving the LCS problem, we have suggested 3 new improved 
algorithms. We hrst reformulate the problem in a very succinct form. The problem LCS is abstracted to 
an abstract data type DS on an ordered positive integer set with a special operation Update{S,x). For 
the two input sequences X and Y of equal length n, the first improved algorithm uses a van Emde Boas 
tree for DS and its time and space complexities are 0(Rloglogn + n) and 0{R), where R is the number 
of matched pairs of the two input sequences. The second algorithm uses a balanced binary search tree 
for DS and its time and space complexities are 0(7? log L + n) and 0(R), where L is the length of the 
longest common subsequence of X and Y. The third algorithm uses an ordered vector for DS and its 
time and space complexities are 0{nL) and 0{R). 


1 Introduction 

The longest common subsequence (LCS) problem is a classic problem in computer science. The problem has 
several applications in many apparently unrelated fields ranging from file comparison, pattern matching and 
computational biology [U [U |H [^. 

Given two sequences X and Y, the longest common subsequence (LCS) problem is to find a subsequence 
of X and Y whose length is the longest among all common subsequences of the two given sequences. 

The classic algorithm to LCS problem is the dynamic programming solution of Wagner and Fischer [15] . 
with O(n^) worst case running time. Masek and Paterson[9] improved this algorithm by using the ”Four- 
Russians” technique to reduce its running time to 0(n^/log n) in the worst case. Since then, there has been 
not much improvement on the time complexity in terms of n found in the literature. However, there were 
several algorithms with time complexities depending on other parameters. For example, Myers in|10] and 
Nakatsu et al. in[TT] presented an 0{nD) algorithm, where the parameter D is the Levenshtein distance of 
the two given sequences. The number of matched pairs of the two input sequences 7?, is perhaps another 
interesting and more relevant parameter for LCS problem. Hunt and Szymanski]^ presented an 0{{R + 
n)logn) time algorithm to solve LCS problem. Their paper also cited applications, where R ^ n and thus 
the algorithm would run in 0(n log n) time for these applications. To the authors’ knowledge, the most 
efficient algorithm sofar for solving the LCS problem is the 0(7? log log n + n) time algorithm presented by 
Iliopoulos and Rahman [6l iTl [8l IT^ . The key point of their algorithm is to solve a restricted dynamic version 
of the Range Maxima Query problem by using some interesting techniques of [12] . and combining them 
with van Emde Boas structure [14]. Readers are referred to[T] for a more comprehensive comparison of the 
well-known algorithms for LCS problem and their behavior in various application. 

In this paper, we will revisit the classic LCS problem for two given sequences and present new algorithms 
with some interesting new observations and some novel ideas. Our main result is an improved 0(7? log log n-|- 
n) time algorithm of Iliopoulos and Rahman. Our novel algorithm has a very simple structure. It is very 
easy to implement and thus very practical. 

The organization of the paper is as follows. 
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In the following 4 sections, we describe our improved 0{R log log n + n) time algorithm of Iliopoulos and 
Rahman for solving LCS problem. 

In Section 2, the preliminary knowledge for presenting our algorithm for LCS problem is discussed, and 
the 0(i? log log n + n) time algorithm of Iliopoulos and Rahman is reviewed briefly. In Section 3, we present 
our improvements on the algorithm of Iliopoulos and Rahman with time complexity 0(i? log log n + n), where 
n and R are the lengths of the two given input strings, and the number of matched pairs of the two input 
sequences, respectively. Some concluding remarks are located in Section 4. 


2 An 0(i? log log n + n) time Algorithm 

In this section, we briefly review the 0(i? log log n + n) time algorithm of Iliopoulos and Rahman[5] for the 
sake of completeness. 

A sequence is a string of characters over an alphabet ^ subsequence of a sequence X is obtained 
by deleting zero or more characters from X (not necessarily contiguous). A substring of a sequence X is a 
subsequence of successive characters within X. 

For a given sequence X = xiX 2 ■ • ■ Xn of length n, the fth character of X is denoted as Xi € 
any i = 1, ■■■ ,n. A substring of X from position i to j can be denoted as X[i : j] = XiXi+i • ■ - Xj. If 
i 1 or j ^ n, then the substring X[i : j] = XiXi+i ■ ■ • Xj is called a proper substring of X. A substring 
X[i : j] = XiXi+i • ■ • Xj is called a prefix or a suffix of Af if z = 1 or j = n, respectively. 

Definiton 1 An appearance of sequence X = xiX 2 ■ ■ ■ Xn in sequence Y = yiy 2 ' ■ ■ Vn, for any X and 
Y, starting at position j is a sequence of strictly increasing indexes zi,Z2,''' An such that ii = j, and 
^ Vil ) yi2 5***5 Vin • 

A match for sequences X and Y is a pair {i,j) such that Xi = yj. The set of all matches, M, is defined 
as follows: 

M = {{i,j)\xt = yj, l<i ,3 <n} 

The total number of matches for X and Y is denoted by R = \M\. 

It is obvious that R < n^. 


Definiton 2 A common subsequence of the two input sequences X = xiX 2 ■ • ■ Xn and Y = yiy 2 ■ • ■ yn, 
denoted cs{X,Y), is a subsequence common to both X and Y. The longest common subsequence of X and 
Y, denoted LCS{X, Y), is a common subsequence whose length is the longest among all common subsequences 
of the two given sequences. The length of LCS{X,Y) is denoted as r{X,Y). 

In this paper, the two given sequences X and Y are assumed to be of equal length. But all the results 
can be easily extended to the case of two sequences of different length. 


Definiton 3 LetT(i,j) denote r{X[l : z],F[I '■ j]), when {i,j) € M. T{i,j) can be formulated as follows: 


ni,j) = 


Undefined if {i,j) ^ M 

1 if ii,j) e Mand {i',j') ^ M,i' < i,j' < j 

maxi<;.<i {T(li,lj)} Otherwise 


( 1 ) 


In the algorithm of Iliopoulos and Rahman[3], a vector H of length n is used to denote the maximum value 
so far of column I of T. For the current value of z G [l-.n], Hi{l) = rnaxi<k<i,{k,i)&M{T{k,l)},l < I < n. 
The footnote i is used to indicate that the current row number is i, and the values oi Hi(l),l < I < n are 
not changed for row i in the algorithm. We can omit the footnote i if the current row is clear. 

The most important function RMQi(left,right) = rnaxieft<i<right{Hi{l)} used in the algorithm is a 
range maxima query on H for the range [left..right], 1 < left < right < n. It is clear that 
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T{i,j) = l + RMQ,{l,j-l), ii{i,j)eM (2) 

For the efficient computation oiT{i,j), the following facts [71 [T^ are utilized in the algorithm of Iliopoulos 
and Rahman. 

Fact 1 Suppose {i,j) G M. Then for all (i',j) G M,i' > i, (resp.{i,j') G M,j' > j), we must have 
> T{i,j) (resp.T(i,j') > T{i,j) ). 

Fact 2 The calculation of the entry T{i,j), (i,j) G M, 1 < i,j < n, is independent of any T{l,q), (l,q) G 
M, I = i,l < j < n. 

The algorithm is proceed in a row by row manner as follows. 


Algorithm 1 Iliopoulos-Rahman-LCS 
1: for i = 1 to n do 

2: H <— S {Update H for the next row} 

3; for all (i,j) G M do 
4: T(^,J)^l + RMQ(l,j-l) 

5: S{j)^T{i,j) 

6: end for 

7: end for 


In the algorithm, another vector S, of length n, is invoked as a temporary vector. After calculating 
T{i,j), the vector S is restored from T, S{j) = T{i,j), if (*, j) G M. Therefore, at the end of processing of 
row i, S is actually The algorithm continues in this way as long as it is in the same row. As soon as it 

comes a new row, the vector H is updated with new values from S. In the algorithm above, RMQ{l,j — 1) 
is actually RMQi{l,j — 1). We can omit the footnote i since the processing is in the same row i. The 
correctness of the above procedure follows from Facts 1 and 2. The problem RMQ can be solved in 0{n) 
preprocessing time and 0(1) time per query)?, [2]. Therefore, for the constant time range maxima query, an 
0{n) preprocessing time has to be paid as soon as H is updated. Due to Fact 2, it is sufficient to perform 
this preprocessing once per row. So, the computational effort added for this preprocessing is O(n^) in total. 

The most complicated part of the algorithm of Iliopoulos and Rahman is in the computation of RMQ{1, j— 
1). To eliminate the term from the running time of the algorithm, a van Emde Boas tree is used to store the 

information in H. This data structure can support the operations Search, Insert, Delete, Min, Max, Sucre, 
and Pred in O(loglogn) time. By using these operations the query RMQ[l,j — 1) can then be answered 
in O(loglogn) time. The Oin) preprocessing step of the algorithm can then be avoided and hence the 
term can be eliminated from the running time. However, as a price to pay, the query time of RMQ{l,j — 1) 
increases to O (log log n). 

Finally, the time complexity is improved to 0(i? log log n + n), using 0(pn?) space. The vEB structure 
described in the algorithm of Iliopoulos and Rahman is somewhat involved. For more details of the data 
structure, the readers are referred to[71 [T2]. 

3 Improvements of the algorithm 

In this section, we will present several improvements on the algorithm of Iliopoulos and Rahman. 

3.1 A first improvement 

In the algorithm of Iliopoulos and Rahman, the set M = {{i,j)\xi = yj,l < i,j < n} is constructed explicitly 
in the lexicographic order of the matches such that all the matches can be processed in the correct order. To 
this purpose, two separate lists. Lx and Ly are built in 0{n) time. For each symbol c G Lx{c) (resp. 
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Ly{c)) stores, in sorted order, the positions of c in X (resp. Y), if any. Then, a van Emde Boas structure 
is used to build the set M in 0(i? log log n) time. This preprocessing step can be removed completely from 
our improved algorithm, since we do not need to access each match (i, j) directly. It will be clear in the 
description of the algorithm in latter section. 

Secondly, to avoid overlap, the information in H are stored separately in row i and i + 1. We have noticed 
that, the only query on H used in the algorithm is of the form RMQ{l,j — 1) = maxi<i^j{H{l)}. It is 
obvious that if in each row i, we process all j G Lyixi) in a decreasing order, i.e. the order of j from large to 
small, then every overlap for current j will not change the values RMQ{l,j — 1) for the succeed j. Therefore, 
the information in H can be updated in same row immediately. In our improved algorithm, the list Ly is 
built in this way such that for each symbol c G the positions of c in F are listed from large to small. 

3.2 The key improvements 

By the definition of Hi{l) = rnaxi<k<iXk,i)€M{T{k, /)}, I < Z < n for the current value of * G [l-.n], it is not 
difficult to observe the following fact. 

Fact 3 For all 1 < i' < i < n, and I < J < n, < Hi[j). 

The information in H we maintained is used for computing RMQ(l,j — I) in the algorithm. If we use 
a vector Q of length n to store the values of Q{j) = RMQ{l,j) for all I < j < n, then RMQ{l,j — 1) 
will be computed more directly by Q{j — I) in 0(1) time. For example, if iJ = (0,1, 2,1, 2,0,1), then Q = 
(0,1, 2, 2, 2, 2, 2). An important albeit easily observable fact about Q is that, its components Q{j), 1 < j < n 
are nondecreasing. 

Fact 4 Suppose Q{j) = RMQ{l,j) = maxi<i<j{F[{l)}. Then the values of Q{j),l < j < n form a 
nondecreasing sequence. This sequence is under a very special form. If Q(j) < Q{j + I),l < j < n, 
then Q(j + I) = Q(j) + 1- Therefore, the values of this sequence are taken from a consecutive integer set 
{0,1, • ■ • , L}, where L = r{X, Y) is the length of the longest common subsequence of X and Y. 

Proof. 

For any 1 < j' < j < n, Q{j') = maxi<i<ji{H{1)} and Q{j) = maxi<i<j{H{l)}. It follows from 
j' < j that {Z|l < I < j'} C {Z|I < I < j}. It follows then Q{j') < Q{j). This proves that the values of 
Qij)A ^ j form a nondecreasing sequence. 

In the case of Q{j) < Q{j + 1), I < J < n, since Q{j + 1) and Q{j) are both nonnegative integers, we 
have: 

Q{j + 1) > Q(j) + 1 (3) 

On the other hand, by the definition of Q, if current row number is i, we have: 


Q{j + 1) = i]iax {i?*(Z)} 

= max{ max {Hi{l)}, Hi{j + I)} 
^<l<3 

= max{Q(j),iJ,(j + 1)} 

= + 1 ) 


(4) 


It follows from the definition of Hi{j + I) that 


+ = 4nax {r(fc,j + I)} 

= T{k',j + 1), I < /c' < i, (k',j + I) G M 
= Qk'U) + 1 

It follows from Fact [3] and k' < i that Hk'{j) < and thus Qk'{j) < Qi{j) = Q{j)- 


(5) 
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It follows from (|1|) and ([5]) that 


QU + 1 ) — Hi{j + 1 ) 

= Qk'[j) + l ( 6 ) 

^ Q{j) +1 

Combining ([3]) and (| 6 ]) we then have, Q{j + 1) = Q{j) + 1- 
The proof is completed. □ 

By the definition of Q we know, if current row number is i, then for 1 < j < n. 


QU) = max{i?,( 0 } 

^<1<J 

= max max iT(k. 7 )} 
^<l<j 'i-<k<i,{k,j)GM 

= max {T(k, 0} 

l<k<i 

^<l<j 


(7) 


At the end of the algorithm, all the n rows are treated. At this time, we have, for 1 < j < n. 


QU) = {T{k,l)} (8) 

l<K<n 

l<l<j 

In other words, QU) = ^ j])) 1 < j < "fi- Especially, Q{n) = L = r{X,Y). If we want compute 

L = r{X,Y), but not LCS{X,Y), then we do not need to store the 2 dimensional array T. At the end of 
the algorithm, Q(n) will return the answer. 

Because of its unusual form, Q can be viewed as a piecewise linear function. As we know, it is sufficient 
to record the break points of such functions to calculate their values. Therefore, we can use another vector 
P of length at most L to record the break points of Q. Let m = maxi<i<„{Q(i)}. For 1 < j < n, the value 
of P{j) can be defined as follows: 


Let 


m 


mini<j<„{i|Q(i) = j} if j < to 
n + 1 Otherwise 


(9) 


SU) = PU), if 1 < j < n, I < P{j) < n (10) 

It is clear that S forms an increasing sequence of length at most L. 

For instance, if Q = (0,1, 2, 2, 2, 2, 2), then P — (2, 3, 8 , 8 , 8 , 8 ,8), and S = {2, 3}. 

Let a = I S'!, then S(a) is the maximal element of S. 

By the definition of S, ii k < a, then for any S{k) < j < S{k + 1 ), we have QU) = k. In the case of 
k = a, for any S{a) < j < n, we have QU) = ct- Therefore, QU) can be easily computed by S as follows. 


QU) = 


k if S{k) <j< S{k + 1) 
a j > S{a) 


Furthermore, we can wind up the following fact. 


( 11 ) 


Fact 5 Suppose PU)^ 1 < j < n be defined by formula (0). Then S = {P(j)|l < j n,l < PU) ^ n-} forms 
an increasing sequence of length at most L. This sequence has a very unique property. For each P{t) G S, 
and any P{t) < j < P(t + 1), t = maxi<f,<i{T(fc, Z)}, if current row number is i. At the end of the algorithm, 

i<i<j 

for each P{f) G S, t = r{X,Y[l : j]) for any P(t) < j < P{t + 1). Especially, the maximal element of S is 
P{L), and L = r{X,Y). 
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Proof. 

By the definition of P, we have, for each P{t) € S, if P{t) < j < P{t + 1), then Q{j) = t. Therefore, it 
follows from formula © that if current row number is i, then t = maxi<fe<i{T(fc, 1)} 

i<i<i 

At the end of the algorithm, all the n rows are treated. At this time, for each P{t) € S, t = 
maxi<fc<„{T(fc, 1)} = r{X, Y[1 : j]). 
i<i<i 

Especially, if P{t') is the maximal element of S, then P{t') < n < P{t' + 1) = n + 1, and thus, 
t' = r{X,Y[l : n]) = r{X,Y) = L, i.e., the maximal element of S is P{L)^ and L = r{X,Y). 

The proof is completed. □ 

In the algorithm, if current match {i,j) is processed, then the value of T(i,j) is changed to 1 + Q{j — 1). 
Let Q{j') be the successor of Q{j — 1) in Q, i.e., 

/ = .min {l\Qil) > QU - 1)} ( 12 ) 

j<l<n 

Then Q[j : j' — 1] must be changed to 1 + Q{j — 1), according to the definition of Q. 

Similarly, Let S{k) be the successor of j — 1 in S', i.e., 

^ = min {1|S(0 > j - 1} (13) 

l<i<n 

Then S(fc) must be changed to j, according to the definition of S. 

In the case of j — 1 has no successor in S, i.e., j — 1 > S{a), then j must be added into S. 

For example, if Q = (0,1, 2,2, 2, 2, 2), S = (2,3), and current match (4,6) is processed, then T(4,6) is 
changed to 1 + Q{j — 1) = 3. In this case, j = 6 and j' = 8, and thus Q [6 : 7] must be replaced by 3. The 
current values of Q becomes Q = (0,1,2, 2,2, 3,3). At this point, j — 1 = 5 > S(a) = 3, and thus j must be 
added to S. The current values of S becomes S = (2, 3, 6 ). 

3.3 The improved algorithm 

It is readily seen from the discussions above that if we can use the ordered set S defined by formula mD to 
calculate the function RMQ{l,j — 1), then the algorithm will be simplified substantially. The key point is 
the way to maintain the ordered set S efficiently. 

Let U = {j\l < j < n}. Suppose DS be an abstract data type on an ordered positive integer set S. The 
abstract data type DS can support the following operations on S : 

1. Size{S) 

A query on an ordered positive integer set S that returns the number of integers in S. 

2. Succ{S,x) 

A query that, given a positive integer x whose key is from U, returns the next larger integer in S, or 
0 if a; is the maximum integer in S. 

3. Pred{S,x) 

A query that, given a positive integer x whose key is from U, returns the next smaller integer in S, or 
0 if X is the minimum integer in S. 

4. Update{S,x) 

A modifying operation that, given a positive integer x, if Succ{S,x — l) > 0, then replace Succ{S,x — l) 
with the integer x, otherwise augments the set S with a new integer x. 

With this abstract data type, we can maintain the ordered positive integer set S Q U defined by formula 
OT in our new algorithm LCS as follows. 
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Algorithm 2 LCS 

1 : 5*^0 

2 : for i = 1 to n do 
3: for all j € Lyixi) do 

4: Update{S,j) 

5: end for 

6 : end for 
7: return Size{S) 


In above algorithm LCS, the list Ly{c),c € S stores, in a decreasing order, the positions of c in Y. 
The |S| lists can be constructed in 0{n) time, by simply scanning Y in turn. At the end of the algorithm, 
L = r{X,Y) = Size{S), the length of LCS{X,Y), is returned. 

The efficiency of the new algorithm is depended heavily on the efficiency of the abstract data type DS, 
especially on the efficiency of its operation Update{S, x). 

We have found that the van Emde Boas tree is an elegant data structure for our purpose. Specifically, van 
Emde Boas trees support each of the following dynamic set operations. Search, Insert, Delete, Min, Max, 
Succ, and Pred in O(loglogn) time. The operation Update{S,x) can be easily implemented by combining 
at most two successive operations delete and insert. 

If we chose van Emde Boas tree as our data structure for S, the new algorithm can be described as 
follows. 


Algorithm 3 vEB_LCS 
1: 5*^0 

2 : for i = 1 to n do 
3: for all j € Lyixi) do 

4: fc -S— SuCc{S,j — 1) 

5: if fc < Max{S) then 

6: Deleters, k) 

7: end if 

8 : Insert{S, j) 

9: end for 

10: end for 
11: return Size{S) 


The structure of algorithm vEB — LCS is very simple. Although the algorithm can correctly return the 
length of LCS{X, Y), it does not directly give LCS{X, Y). If we want to compute LCS{X, Y), but not just 
its length, we have to record more information. A commonly used method is to record the predecessor of 
each match {i,j) G M by a two dimensional array like T{i,j). The two dimensional array returned by the 
algorithm allows us to quickly construct an LCS of X and Y. This method requires extra 0{n^) space. 
Eor this purpose, we can design a more efficient method using only 0{R) space. We use two vectors B and 
C, both of length R, to record the predecessor’s match number and the matched character for each match 
(i, j) € M, respectively. The match numbers for all matches {i,j) G M are generated one after another in 
the algorithm. 

In the above algorithm, the vector I? is a temporary vector of length at most L used to store the match 
numbers for current set S. With the two vectors B and C computed in the above algorithm, the following 
recursive algorithm prints out LCS{X, Y). The initial call is Print — LCS{Size{S)). 

It is obvious that Print — LCS{Size{S)) takes time 0{L), since it prints one character of LCS{X, Y) in 
each recursive call. Einally, we can find that the following theorem holds. 

Theorem 1 The algorithm vEB_LCS correctly computes LCS{X,Y) in 0{Rlog\ogn + n) time and 0{R) 
space in the worst case, where R is the total number of matches for X and Y. 
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Algorithm 4 vEB_LCS 
1 : m •(— 0, S' ■<— 0 
2: for I = 1 to n do 
3: for all j G Lyixi) do 

4: k <r- Succ{S,j — 1) 

5: if fc < Max{S) then 

6 : Delete{S,k) 

7: end if 

8 : Insert{S,j) 

9: p^Pred{S,j) 

10: TO ■(— m + 1 

11 : B(rn) ^ D{p),C{'m) ^ j,D{j) ^ m 

12: end for 

13: end for 

14: return Size{S) 


Algorithm 5 Print-LCS{k) 
1: if /c < 0 then 
2: return 

3: else 

4: Print — LCS{B{k)) 

5: print Y{C{k)) 

6: end if 


Proof. It follows from formulas (EH) and m that the values of RMQ{l,j — 1) for each row i can be correctly 
computed by the set S maintained in the algorithm. Therefore, the algorithm can compute LCS{X,Y) 
correctly as the original algorithm of Iliopoulos and Rahman. It is obvious that the computation of list 
Ly{c),c G S, costs 0(n) time and space. For each match (i,J) G M, the algorithm executes each of the 
4 operations Succ, Succ^Succ, and Succ at most once, and each of the 4 operations costs O(loglogn). 
Therefore, the total time spent for these operations is 0(i? log log n). The worst case time complexity of the 
algorithm is therefore 0(i? log log n + n). 

To maintain the van Emde Boas tree, 0(n) space is sufficient. To reconstruct the LCS{X,Y), the 
algorithm uses two vectors B and C, both of size R. The space required by the algorithm is thus 0(R). The 
worst case space complexity of the algorithm is therefore 0{R). 

The proof is completed. □ 

If we chose a balanced binary search tree such as red-black tree as our data structure for the ordered 
positive integer set S', the following dynamic set operations. Search, Insert, Delete, Min, Max, Succ, and 
Pred can be implemented in 0(log |S|) time. In this case. The time complexity of our algorithm becomes 
0(i?logL), since |S| < L. We then can find that the following theorem holds. 

Theorem 2 The longest common subsequence problem can be solved in 0{R\ogL + n) time and 0{R) space 
in the worst case, where n,L and R are the length of input sequences X and Y, the length of LCS{X,Y), 
and the total number of matches for X and Y, respectively. 

The ordered positive integer set S in our algorithm can also be efficiently purported by an ordered vector 
s of length at most L as follows. 

In the above algorithm, for each row i, all columns j G Lyixi) are processed in a decreasing order. The 
successors of all j — 1 in s are searched and updated also in a decreasing order from the largest element 
s(a). It is obvious that the time spent for each row i is 0{a). Therefore, we can conclude that the time 
complexity of the above algorithm is 0(nL), since a < L. 










Algorithm 6 V-LCS 
1: O i — 0, k i — —1 
2: for i = 1 to n do 
3: for all j € Lyixi) do 

4: while k > 0 and s{k) > j do 

5: k k — 1 

6: end while 

7: s{k + 1) ^ j 

8: if fc = a then 

9: CX i — 1 “t- O 

10: end if 

11 : k a 

12: end for 

13: end for 
14: return a 


Theorem 3 The longest common subsequence problem can be solved in 0{nL) time and 0(R) space in 
the worst case, where n and L are the length of input sequences X and Y, and the length of LCS(X,Y), 
respectively. 
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